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pages 
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XJ the claims: 
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pages . filed with the letter of ^ . 
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citations and explanations supporting such statement 



1. STATEMENT 

Novelty (N) 



Claims 7 and 1 1 



Claims 1-6 and 8-10 



Inventive Step (IS) 



Industrial Applicability (lA) 



Claims 7 and 11 



Claims 1-6 and 8-10 



YES 
NO 

YES 
NO 



Claims l-ii 
Claims 



YES 
NO 



ckavage of an intein having an anrnw^ienninal cysteine fused U) Uie target polypepude "^^^^ jde is ac^mplished in vitro by 
aSnt'tenninal cysteine. Muir et al. disclose O^at ttie fiision of .he ^^'^ ^'yj>^'^' ^^^"XTZ c^ox^terminal phenyl 

including the fused first, target, polypeptide. 

Claims 9 and 10 lac. novelty under PCT An.cle 33(2) as ^--g anticipa^d hy ^^r^^/^rerbv ^^L^^S^ l^^o/ 

^Tt^L Ld a target 'polypeptide, dtus also meeting Oie linutations of clause (c) of claun 10. 

Claitn 8 lacs novelty under PCT Article 33(2) as being anticipated by Te^ent. who 1^7 ^f^s..^;^^^^^^^ Oiat 
compHses a mutant Mycobacterium xenopi GyrA intern capable, ^^J^^'^^f P^^" ^^^^ of an 

iL^:^;-r^ra-^^^^^^ 

meeting limitations of claim 8. 

. .,r Pf-r Arrirle ^^(2V(A) because Uie prior art does not leach or fairly suggest the 
thioester of a target proiem and an amino^proximal c>stcme ot anomer ^e^on ^ J solicine or cleavage event. Claims 
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The foUowing defects in the form or contents of the international application have been noted: 

Claim 11 is objected to under PCT Rule 66.2(a)(m) as coniaimng the followmg defecKs) in the form or contents thereof: the word 
"terminal" is misspelled at line 4 of claim 11. 
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(54) Title: INTEIN MEDIATED PEPHDE LIGATION 
(57) Abstract 

In accordance with the present invention, there is provided a 
method for producing a semi-synthetic fusion protein in vitro, 
comprising the steps of producing a target protein fused to a protein 
splicine element (an intein) and selectively cleaving the fusion and 
iigating a synthetic protein or peptide at the C-termma) thioester of 
the target protein, which overcome many of the disadvantages and 
problems noted above. Specifically, the present invention has higher 
yields due to better thiol-induced cleavage with thiol-reagents which 
have been optimized for the ligation reaction, ofl-colunin ligation 
allows sample concentration and allows the use of less peptide, 
MESNA is an odorless thiol-reagent for ligation, and Mxe mtein is 
froiu a bactcnai source and often expresses better m bacTerial cells 
Furlhermore. the present invention allows peptides to be directly 
hgated to the thioester bond formed between an intern and the target 
protein The r:esen: ii-vcni^or^ ^hr■ provides a method for producing 
a cytotOMC protein, comprising the steps of producmg a truncated, 
inactive form of the piotem in vivo which is fused to a protein 
splicing element, and srieaivcl) cleaving the tuSion and iigaiing a 
svnthetic protein or peptide at a C-terminal thioester of the target 
protem to restore the aai\ii> ^'l uie nau\t: vnUjIuai^ protein 
} Recombinant vectors for producing such cleavabic fusion proteins 
] are also provided. 
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(54) Title: INTEIN MEDIATED PEPTIDE LICjATION 
(57) Abstract 

In accordance with the present invention, there is provided a method for producing a semi-synthetic fusion protein in vitro, comprising 
the steps of producing a target protein fused to a protein splicing element (an intein) and selectively cleaving the fusion and ligating a 
synthetic protein or peptide at the C-tenninal thioester of the target protein, which overcome many of the disadvantages and problems 
noted above. Specifically, the present invention has higher yields due to better thiol-induced cleavage with thiol-reagents which have been 
optimized for the ligation reaction, off-column ligation allows sample concentration and allows the use of less peptide, MESNA is an 
odorless thiol -re agent for ligation, and Mxe intein is from a bacterial source and often expresses better in bacterial cells. Furthermore, the 
present invention allows peptides to be directly Ugated to the thioester bond formed between an intein and the target protein. The present 
invention also provides a method for producing a cytotoxic protein, comprising the steps of producing a truncated, inactive form of the 
I protein in vivo which is fused to a protein splicing clement, and selectively cleaving the fusion and hgating a synthetic protein or peptide at 
C-tcrminal thioester of the target protein to restore the activity of the native cytotoxic protein. Recombinant vectors for producing such 
lea V able fusion protems are also provided. 
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INTEIN MEDIATED PFPTIDE LIGATION 



BACKGROUND OF THE INVENTION 
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Genetic engineering is a powerful approach to the 
manipulation of proteins. However, genetic methodologies are 
constrained by the use of only naturally coded amino acids. 
Furthermore, cytotoxic proteins are difficult to obtain by 
expression and isolation from a living source, since the 
expression of the toxic protein can result in death of the host. 

To some extent, protocols have been developed to 
circumvent these problems, for example, total chemical 
synthesis (Kent, S. B, (1988) Ann. Rev. Biochem. 57:957-989), 
use of misacylated tRNAs (Noren, et al., (1989) Science 
244:182-188), and semi-synthetic techniques (reviewed in 
Offord, R. (1987) Protein Eng. 1:151-157; Roy. et al. (1994) 
Methods in Enzymol. 231:194-215; Wallace, C. J. (1993) FASEB 
7:505-515). However, all of these procedures are limited by 
either the size of the fragment which can be generated or by 
low reaction yield. 

It would therefore be desirable to develop a high-yield, 
semi-synthetic technique to allow in vitro fusion of a 
synthetic protein or peptide fragment to an expressed protein 
without limitation as to the size of the fused fragments. 
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Likewise, in order to produce cytotoxic proteins, it 
would be desirable to develop a method of fusing a synthetic 
fragment, \n vitro, to an inactive, expressed protein, so as to 
restore protein activity post-production from the host. 

The modified See VMA intein has been used to generate 
thioester-tagged proteins for use in ligation (Example 19, 
U.S. S.N. 08/811,492. filed June 16, 1997; Chong, (1996) J. 
Biol. Chem., 271 (36):221 59-221 68; Chong, (1997) Gene, 
192:271-281; and Muir, et al. (1998) Proc. Natl. Acad. Sci USA 
95:6705-6710). 

Some disadvantages have been low yields due to poor 
cleavage of the See VMA intein with thiol-reagents that are 
optimum for ligation, the need for large peptide quantities due 
to on-column reactions, the use of odoriferous reagents, 
and/or low protein yields due to the use of a large, eukaryotic 
intein. 



In accordance with the present invention, there is 
provided a method lot producing a semi-synthetic fusion 
protein in vitro, comprising the steps of producing a target 
protein fused to a protein splicing element (an intein) and 
selectively cleaving the fusion and ligating a synthetic 
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protein or peptide at the C-terminal thioester of the target 
protein, which overcome many of the disadvantages and 
problems noted above. Specifically, the present invention has 
higher yields due to better thiol-induced cleavage with thiol 
5 reagents which have been optimized for the ligation reaction. 

Off-column ligation allows for sample concentration as well 
as the use of less peptide. In a particularly preferred 
embodiment, thiol reagents such as 2-mercaptoethanesulfonic 
acid (MESNA), which is an odorless thiol-reagent, is used for 

10 cleavage and ligation along with the Mxe intein, which is from 

a bacterial source and often expresses better in bacterial 
cells. Furthermore, the present invention allows peptides to 
be directly ligated to the thioester bond formed between an 
intein and the target protein. The present invention also 

15 provides a method for producing a cytotoxic protein, 

comprising the steps of producing a truncated, inactive form 
of the protein m vivo which is fused to a protein splicing 
element, and selectively cleaving the fusion and ligating a 
synthetic protein or peptide at a C-terminal thioester of the 

20 target protein to restore the activity of the native cytotoxic 

protein. Recombinant vectors for producing such cleavable 
fusion proteins are also provided. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 is a flow diagram depictmg the chemical 
reactions which enable intein-mediated peptide ligation. The 
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thioester generated at the C-terminus of the target protein 
during IMPACT™ purification was used in a 'native chemical 
ligation' reaction. This allowed the ligation of a synthetic 
peptide to a bacterially expressed protein. A typical ligation 
reaction involved the expression of the target protein-intein- 
CBD fusion followed by binding to a chitin resin. A thiol 
reagent induced cleavage of the intein. The target was eluted 
from the chitin resin and a synthetic peptide was added. The 
ligation reaction proceeded overnight. 

Figure 2 is a gel depicting the results of cleavage and 
ligation reactions using various thiols. Cleavage and ligation 
reactions with different thiols visualized on 10-20% Tricine 
gels. MYB (a fusion protein of maltose binding protein-See 
VMA intein (N454A)-chitin binding domain) and MXB (a fusion 
protein of maltose binding protein-Mxe GyrA (N198A) intein- 
chltin binding domain) were incubated overnight at 4°C with 
various thiols (50 mM) in 150 mM Tris, 100 mM NaCI, pH 8 in 
the presence of a 30 amino acid peptide with an N-terminal 
cysteine. The peptide ligates to the C-terminus of MBP. Lanes 
1-5 ligation with MYB. Lane 1 no thiol. Lane 2 dithiothreitol. 
Lane 3 2-mercaptoethanesulfonic acid. Lane 4 3- 
mercaptopropionic acid. Lane 5 tniophenol. Lanes 6-10 
ligation with MXB. Lane b no thiol. Lane 7 dithiothreitol. Lane 
8 2-mercaptoethanesulfonic acid. Lane 9 3-mercaptopropionic 
acid. Lane 10 thiophenol. 
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Figure 3 is a gel depicting direct ligation of a peptide to 
the thioester formed between the See VMA intein and maltose 
binding protein. SDS-PAGE of direct ligation reaction with a 
10-20% Tricine gel. Lane 1: a precursor protein (MYBIeu) 

5 consisting of maltose binding protein-Sce VMA1 intein-chitin 

binding domain was heated to >95°C for 5 minutes in a buffer 
of 50 mM Trizma base, pH 8.5 containing 100 mM NaCI, 1% SDS, 
and mM tris-(2-carboxyethyl)phosphine (TCEP) followed by 
overnight incubation at room temperature. The precursor 

10 (MYBIeu) is visible along with the See VMA1 intein (Y) and 

maltose binding protein (M), which are cleavage products. 
Lane 2: the precursor protein was subjected to the same 
conditions as described in Lane 1 except that the 30 amino 
acid peptide (1 mM) was added. The precursor (MYB) and 

15 cleavage products (Y and M) are visible along with the ligation 

product (M+30mer) formed when the 30 amino acid peptide 
fuses to maltose binding protein. 



Figure 4 is a diagram depicting the pTXBI expression 
20 vector of Example I (SEQ ID NO:7 and SEQ ID NO:8). 

Figure 5 is the DNA sequence of pTXBI (SEQ ID NO:5). 

Figure 6 is a gei depicting the results of the Hpa\ protein 
25 ligation reaction. Protein ligation reactions examined on 10- 

20% Tricine gels. Lane 1: clarified cells extract after IPTG 
(0.5 mM) induction of ER2566 cells containing the pTXB2-Hpa/ 
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plasmid. The fusion protein of Hpal^^^-Uxe GyrA-intein-CBD 
(52 kDa) is visible. Lane 2: cell extract as in Lane 1 after 
passage over a chitin column, which results in the binding of 
the fusion protein. Lane 3: Hpal^^^ (25.7 kDa) after cleavage 
5 from the fusion protein by addition of MESNA. Lane 4: ligation 

product of Hpa/223 (0.2 mg/mL) with 1 mM of a 31 amino acid 
peptide (ligation product 29.6 kDa), representing the residues 
necessary to generate full length Hpal, after overnight 
incubation at 4°C. Lane 5: full length Hpa\ from a recombinant 
10 source (29.6 kDa) containing BSA (66 kDa) and two impurities. 

Figure 7 is a western blot of various proteins ligated to 
a biotinylated peptide. Proteins purified with the Mxe GyrA 
IMPACTT"^ derivative were ligated to a synthetic peptide which 
15 contained an antibody recognition sequence. 

DETAILED DESCRIPTION OF THE INVENTION 

The ligation methods of the present invention are based 
20 on the discovery that a cysteine or peptide fragment 

containing an N-terminal cysteine may be fused, in vitro, to a 
bacterially expressed protein produced by thiol-induced 
cleavage of an intern (U.S. Patent No. 5,496,714, Example 19 of 
U.S. S.N. 08/811,492 filed June 16, 1997, Cheng, et al., 0996j 
25 supra and Chong, et al., (1997) supra. 
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The ligation procedure disclosed herein utilizes a 
protein splicing element, an intein (Perler, et al., (1994) 
Nucleic Acids Res. 22:1125-1127) to precisely create a 
thioester at the C-terminal a-carbon of an expressed protein. - 
5 This reactive thioester could be present between the target 

protein and intein or generated by the addition of a thiol 
reagent. Previously the generation such a thioester was 
described using an intein (CIVPS) that was modified to 
undergo thiol inducible cleavage at its N-terminal junction in 

10 the presence of thiol reagent dithiothreitol (DTT) (Chong, et 

al. (1997) supra; Comb, et.al. U.S. Patent No. 5,496,714). This 
C-terminal thioester was previously used in a 'native 
chemical ligation' type reaction (Dawson, et al., (1994) 
Science 266:776-779) to fuse 35s-cysteine or a peptide 

15 fragment containing an N-terminal cysteine to a bacterially 

expressed protein (Example 19, Comb, et.al. U.S. Patent No. 
5,834,247, Chong (1996) supra and Chong (1997) supra. 

The ligation method of the instant invention begins with 
20 the purification of the thioester-tagged target protein using 

an intein as described (Chong, et.al. (1997) supra). The direct 
ligation method of the instant invention begins with the 
isolation of a precursor composed of the target protein 
intein-CBD. In one preferred embodiment, the host cell is 
25 bacterial. In other embodiments the host cell may be yeast, 

insect, or mammalian. A cysteine thiol at the N-terminus of a 
synthetic peptide nucleophilicly attacks a thioester present 
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on the freshly isolated C-terminal a-carbon of the target 
protein or directly attacks the thioester present between the 
target protein and intein. This initially generates a thioester 
between the two reactants which spontaneously rearranges 
5 into a native peptide bond (Figure 1). 

In order to optimize the ligation efficiency so that 
greater than 90% of the bacterially expressed target protein 
can be fused to the synthetic peptide or protein, specific thiol 

10 reagents and inteins are screened. In a preferred 

embodiment, the intein may be any CIVPS, such as See VMA, 
Mxe GyrA or derivatives of mutants thereof, and the thiol 
reagent is 2-mercapto-ethanesulfonic acid, thiophenol, DTT, 
or 3-mercaptopropionic acid (Comb, et al., U.S. Patent No. 

15 5,496,714; U.S. Patent No. 5,834,247). 

In one particularly preferred embodiment, an intein 
whose protein splicing activity has been blocked by mutation 
is utilized. The mutant must, however, retain the ability to 

20 undergo the N-S shift, thus allowing thioester formation 

between itself and an N-terminal protein. This thioester can 
then be nucleophilicly attacked by a thiol reagent or by the N- 
terminal cysteine of a peptide sequence. For example, by 
mutating the C-terminal asparagine ^asn 198; of an intern 

25 from the GyrA gene of Mycobacterium xenopi (Telenti, et al., 

(1997) J Bacteriol 179:6378-6382) to an alanine created a 
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thiol inducible cleavage element. This modified intein cleaved 
well with thiol reagents that were optimal for the ligation 
reaction, such as MESNA and thiophenol. Furthermore, optimal 
thiol reagent and intein combinations can be determined by 
incubating a precursor protein containing the intein of 
interest with a wide variety of thiol reagents followed by 
determination of the extent of cleavage of the precursor 
protein (Figure 2). 

The use of such intein and specific thiol reagents leads 
to optimal yields and high ligation efficiencies; typically 
greater than 90% of the N-terminal ligation fragment can be 

modified. 

The ligation methods of the present invention expand the 
ability to incorporate non-coded amino acids into large 
protein sequences by generating a synthetic peptide fragment 
with fluorescent probes, spin labels, affinity tags, 
radiolabels, or antigenic determinants and ligating this to an 
in vivo expresed protein isolated using a modified intein. 

Furthermore, this procedure allows the isolation of 
cytotoxic proteins by purifying an mactive truncated 
precursor from a host source, for example bacteria, and 
generating an active protein or enzyme after the ligation of a 
synthetic peptide. For example, restriction endonucleases 
which have not successfully been cloned by traditional 
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methods may be produced in accordance with the present 
invention. 

Also, the direct ligation procedure allows the ligation of 
a protein or peptide sequence to another protein or peptide 
sequence without the use of exogenous thiol reagents. Direct 
ligation relies on the nucleophilic attack of the N-terminal 
amino acid of one peptide on the thioester formed between a 
target protein and an intein (Figure 3). 



In summary, a fusion protein can be created using the 
methods of the present invention that possesses unique 
properties which, currently, can not be generated genetically. 

The Examples presented below are only intended as 
specific preferred embodiments of the present invention and 
are not intended to limit the scope of the invention. The 
present invention encompasses modifications and variations 
of the methods taught herein which would be obvious to one of 
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ordinary skill in the art. 



The references cited above and below are herein 
mcorporated by reference. 
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EXAMPLE I 

Creation of vectors pTXBI and pTXB2 for ligation: 

5 Asparagine 198 of the Mxe GyrA intein (Telenti, et al., 

(1997) J Bacteriol. 179:6378-6382) was mutated to alanine by 
linker insertion into the Xmn\ and Pst\ sites of 
pmxeMIPTyrXmnSPdel to create pMXP^. The Xmn\ site was 
originally introduced into the unmodified Mxe GyrA intein 
10 sequence by silent mutagenesis. The Pst\ site was a unique 

site in the plasmid. The linker was composed of mxe#3 (5'- 
GGTTCGTCAGCGACGCTACTGGCCTCACCGGTTGATAGCTGCA-3') 
(SEQ ID N0:1) and mxe#4 (5'-GCTATCAACCGGTGAGGCCAGTAG 
CGTGGCTGACGAACC-3') (SEQ ID N0:2). 

15 

Into pMXPI another linker composed of mxe#1 (5'-TC 
GAATGTAGACATATGGGCATGGGTGGCGGCCGCCTCGAGGGCTCTTCC 

TGGATGACGGGAGATGCA-3') (SEQ ID N0:3) and mxe#2 (5'-GTAG 
TGGATGTGCCGTGATGCAGGAAGAGGCCTCGAGGGGHGGCGCGAGGGA 

20 TGGGGATATGTGTAGAT-3') (SEQ ID N0:4) was inserted into the 

Xho\ and Spel sites to introduce a multiple cloning site {Xba\- 
Nde\'Nco\-Not\-Xho\-Sap\) before the Mxe GyrA intein (pMXP2). 



2 5 



The 0.6 kilobase NoU to Age\ fragment of pMXP2 was 
ligated into the same sites in pTYB1 (IMPACT kit, New England 
Biolabs, Beverly, MA) and the Nco\ to Age\ fragment of pMXP2 
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was cloned into pTYB3 (IMPACT kit, New England Biolabs, 
Beverly. MA) to create plasmids pTXB1 (see Figure 4 and 5) 
(SEQ ID NO:5) and pTXB2, respectively. These vectors have a 
multiple cloning site upstream of the modified Mxe GyrA 
intein-chitin binding domain fusion. This allows the insertion 
of a target gene of interest inframe with the intein and chitin 
binding domain (CBD). 

Creation of vectors pMYBIeu for ligation: 

pMYBIeu was as described in Chong, et a!., (1998), J. Biol. 
Chem. 273:10567-10577. This vector consisted of maltose 
binding protein upstream of the See VMA intein-chitin binding 
domain. A leucine is present at the -1 position instead of the 
native residue (which is a glycine). 

Purification of Thioester-Tagged Proteins: 

Protein purification was as described using the See VMA 
intein (Chong, et.al., (1997) Gene 192:271-281) with slight 
modification. ER2566 cells (IMPACT T7 instruction manual 
from New England Biolabs, Beverly, MA) containing the pTXB 
vector with the appropriate msert were grown to an ODeoO of 
0.5-0.6 at 370C at which point they were induced with 0.5 mM 
IPTG overnight at 150C. Cells were harvested by 
centrifugation and lysed by sonication (performed on ice). The 
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three part fusion protein was bound to chitin beads (10 mL bed 
volume, Figure 6, lanes 1 and 2) equilibrated in Buffer A (50 
mM Tris, pH 7.4, and 500 mM NaCI), and washed with 10 
column volumes of Buffer A to remove unbound material. 

5 

Cleavage was initiated using a buffer of 50 mM 2- 
mercaptoethanesulfonic acid (MESNA), 50 mM Tris, pH 8.0 and 
100 mM NaCI. Other thiol reagents were also used at other 
times, such as thiophenol, dithiothreitol, and/or 3- 
10 mercaptopropionic acid. After overnight incubation at from 4- 

25OC protein was eluted from the column (Figure 6 lane 3). 
This protein contained a thioester at the C-terminus. 

Purification of MYB. MYBIeu and MXB: 

15 

Full length precursor proteins consisting of maltose 
binding protein-Sce VMA intein (N454A)-chitin binding domain 
(MYB) and maltose binding protein-/Wxe GyrA (N198A) intein- 
chitin binding domain (MXB) were purified after induction and 

20 sonication, as described above, by applying the sonicated 

sample to a 10 mL column of amylose resin (New England 
Biolabs, Beverly, MA). Unbound proteins were washed from the 
column with 10 column volumes of Buffer A (see purification 
of thioester-tagged proteins). Bound proteins were eluted 

25 with a buffer of 50 mM Tris, pH 8, containing 100 mM NaCI and 

10 mM maltose. Fractions were collected and protein 
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concentrations were determined using the Bio-Rad Protein 
Assay (Hercules, CA). 

Peptide Synthesis: 

Peptides for subsequent ligation reactions were 
synthesized on an ABI model 433A peptide synthesizer 
utilizing FastMoc"' chemistry (Fields, et al., (1991) Pept Res 
4, 95-101) at a 0.085 mmol scale. Preloaded HMP (p- 
hydroxymethylphenoxymethyl) polystyrene resins (Applied 
Biosystems, Foster City, CA) functionalized at 0.5 mmoi/g 
was used in conjunction with Fmoc/NMP chemistry utilizing 
HBTU amino acid activation (Dourtogiou, et al.. (1984) 
Synthesis 572-574; Knorr, et al., (1989) Tetrahedron Lett 30, 
1927-1930). Fmoc amino acids were purchased from Applied 
Biosystems (Foster City, CA). 

Synthesis proceeded with a single coupling during each 
cycle. Peptide cleavage from the resin and simultaneous 
removal of side chain protecting groups was facilitated by the 
addition of cleavage mixture (Perkin Elmer, Norwalk, CT) 
consisting of 0.75 g phenol. 0.25 mL 1 ,2-ethanedithioI, 0.5 mL 
deionized H2O, and 10 mL TFA. The resin was flushed with 
nitrogen and gently stirred at room temperature for 3 hours. 
Following filtration and precipitation into cold (OoC) methyl- 
t-butyl ether, the precipitate in the ether fraction was 



wo 00/18881 

collected by centrifugation. The peptide precipitate was 
vacuum dried and analyzed by mass spectrometry using a 
Perceptive Biosystems (Framingham, MA) MALDI-TOF mass 
spectrometer. 

Final purification was by HPLC using a Waters HPLC 
system with a Lambda-Max Model 481 Multiwavelength 
detector (set at 214 nm), 500 series pumps and automated 
gradient controller with a Vydac semi-preparative CI 8 
column. Elution of the peptide was with a 60 minute linear 
gradient of 6-60% acetonitrile (v/v) in an aqueous solution of 
0.1% TFA (v/v). 
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Protein Cleavage and Ligation Reactions: 

Cleavage of MYB and MXB: The precursor protein (1 
mg/mL) was incubated overnight at 4oC with or without a 
thiol reagent (50 mM) in 150 mM Tris, pH 8, containing 100 mM 
NaCI. 

Ligation reactions with MYB and MXB: The precursor 
protein (1 mg/mL) was treated as described for cleavage 
except that a 30 amino acid peptide (1 mM final concentration, 
NH2-CAYKTTQANKHIiVACEGNPYVPVHFDASV-C00H (SEQ ID NO:6) 
was also included in the reaction (Figure 2). 




PCT/US99/22776 



Ligation reactions after purification of thioester-tagged 
nroteins: LvoDhilized peptides (New England Biolabs, Beverly, 
MA) were added (to 1 nnM final concentration) directly to the 
thioester-tagged protein freshly isolated from the chitin 

5 column. The reaction was allowed to proceed overnight at 

from 4-250C. In both ligation procedures the condensation of 
the reactants is visible on a 10-20% Tricine gel (Figure 6). 
The ligation reaction was tested in conditions of 5-150 mM 
Tris or HEPES buffers, 50-1000 mM NaCI, 10 mM Maltose, and 

10 pH 6-11 and 0-6 M Urea. 

Direct Ligation Reactions: 

MYBIeu (1 mg/mL) was incubated in 6 M Urea or 1% SDS, 
15 pH 7.5-8.5, 50-200 mM NaCI, and 1 mM of a 30 amino acid 

peptide (NH2CAYKTTQANKHIVVACEGNPYVPVHFDASV-COOH (SEQ 
ID NO:6)). The MYBIeu was incubated for 0-180 minutes at 
either 4°C or 100°C prior to the addition of the 30 amino acid 
peptide. Ligation reactions proceeded overnight at either 4=C 
20 or 25°C. 
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EXAMPLE II 



Labeling a target protein: Maltose Binding Protein 

Maltose binding protein (MBP, 42 kDa) was isolated as 
described in Example I above using the IMPACT procedure 
(IMPACT manual from New England Biolabs, Inc., Beverly, MA) 
in tiie presence of MESNA. 

A biotinylated peptide possessing an N-terminal 
cysteine (CDPEK*DS-GOOH (SEQ ID NO:9)), in which the biotin 
was attached to the e-amino group of the lysine residue) was 
ligated to the freshly purified target protein as described 
above. Briefly, 4 ^xL of biotinylated peptide (10 mM) were 
mixed with a 36 i^L aliquot of the freshly purified MBP sample. 
The mixture was incubated at 4°C overnight. 

Western blots with alkaline phosphatase linked anti- 
biotin antibody detected the presence of the ligated product 
but not the unligated target protein (Figure 7). The efficiency 
of the ligation is typically greater than 90% when MESNA is 
used for cleavage. 
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EXAMPLE III 

Labeling a target protein: Bst DNA Polymerase I Large 
Fragment (Bst Pol 1) 

Bst DNA Polymerase I large fragment (67 kDa) was 
isolated as described in Example I above using the IMPACT 
procedure (IMPACT manual from New England Biolabs, Inc., 
Beverly, MA) in the presence of MESNA. 

A biotinylated peptide possessing an N-terminal 
cysteine (CDPEK*DS-COOH (SEQ ID NO:9)), in which the biotin 
was attached to the e-amino group of the lysine residue) was 
ligated to the freshly purified target protein as described. 
Briefly, 4 ^iL of biotinylated peptide (10 mM) were mixed with 
a 36 \xL aliquot of the freshly purified Bst Pol 1 sample. The 
mixture was incubated at 4°C overnight. 

Western blots with alkaline phosphatase linked anti- 
biotin antibody detected the presence of the ligated product 
but not the unligated target protein (Figure 7). The efficiency 
of the ligation is typically greater than 90% when MESNA is 
used for cleavage. 
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EXAMPLE IV 
Labeling a target protein: Paramyosin 

Paramyosin (29 kDa) was isolated as described in 
Example I above using the IMPACT procedure (IMPACT manual 
from New England Biolabs, Inc., Beverly, MA) in the presence of 
MESNA. 

A biotinylated peptide possessing an N-terminal 
cysteine (CDPEK*DS-COOH (SEQ ID NO:9)), in which the biotin 
was attached to the f-amino group of the lysine residue) was 
ligated to the freshly purified target protein as described. 
Briefly, 4 ^iL of biotinylated peptide (10 mM) were mixed with 
a 36 |.iL aliquot of the freshly purified paramyosin sample. The 
mixture was incubated at 4°C overnight. 

Western blots with alkaline phosphatase linked anti- 
biotin antibody detected the presence of the ligated product 
but not the unligated target protein (Figure 7). The efficiency 
of the ligation is typically greater than 90% when MESNA is 
used for cleavage. 
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EXAMPLE V 

Labeling a target protein: E. coll Thioredoxin 

E. coW thioredoxin (12 kDa) was isolated as described in 
Example I above using the IMPACT procedure (IMPACT manual 
from New England Biolabs, Inc., Beverly. MA) in the presence 
ot MESNA. 

A biotinylated peptide possessing an N-terminal 
cysteine (CDPEK*DS-COOH (SEQ ID NO:9)), in which the biotin 
was attached to the c-amino group of the lysine residue) was 
ligated to the freshly purified target protein as described. 
Briefly, 4 of biotinylated peptide (10 mM) were mixed with 
a 36 aliquot of the freshly purified thioredoxin sample. The 
mixture was incubated at 4°C overnight. 

Western blots with alkaline phosphatase linked anti- 
biotin antibody detected the presence of the ligated product 
but not the unligated target protein (Figure 7). The efficiency 
of the ligation is typically greater than 90% when MESNA is 
used for cleavage. 
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EXAMPLE VI 



isolation of a cytotoxic protein: 

The ligation procedure of Example I was applied to the 
isolation of a potentially cytotoxic protein. An endonuclease 
from Haemophilus parainfluenzae {HpaV, Ito, et al., (1992) 
Nucleic Acids Res 20:705-709) was generated by ligating an 
inactive truncated form of the enzyme expressed in E. coli 
(ER2566 cells, New England Biolabs, Inc., Beverly, MA) with 
the missing amino acids that were synthesized chemically. 

The first 223 amino acids of Hpa\ (full length Hpa\ is 
254 amino acids) were fused in frame with the modified Mxe 
GyrA intein and the CBD. The 223 amino acid Hpa\ fragment 
was isolated as described for purification of thioester tagged 
proteins. The truncated Hpa\ displayed no detectable 
enzymatic activity. 
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A synthetic peptide representing the 31 amino acids 
needed to complete Hpal was ligated onto the 223 amino acid 
truncated form of Hpal by the method of Example I. 



• 
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Enzymatic Assay for Hpal: 

The activity of the fused Hpa\ was determined by its 
ability to digest Lambda DNA (New England Biolabs, Beverly, 
MA). Serial dilutions of ligated or truncated Hpa\, with the 
appropriate peptide added to 1 mM, were incubated with 1 /vg 
of Lambda DNA for 1 hour at 370C in a buffer of 20 mM Tris- 
acetate, pH 7.9, 10 mM magnesium acetate, 50 mM potassium 
acetate, 1 mM dithiothreitol, and 170 /vg/mL BSA (total 
volume 30 //L). Digestion reactions were visualized on 1% 
agarose gels permeated with ethidium bromide. One unit of 
Hpa I was defined as the amount of enzyme necessary to 
digest 1 /yg of Lambda DNA in one hour at 37oC. 
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The newly ligated Hpa\ had a specific activity of 0.5- 
1.5x106 units/mg which correlated well with the expected 
value of 1-2x106 units/mg for the full length enzyme. 
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WHAT IS CLAIMED IS: 

1. A method for fusing an expressed protein with a peptide, 
said method comprising the steps of: 

(a) generating at least one C-terminal thioester- 
tagged target protein; 

(b) generating at least one target peptide having a 
specified N-terminal; and 

(c) ligating said target peptide to said target protein. 

2. The method of claim 1, wherein said target protein is 
generated from a first plasmid comprising an intein 
having N-terminal cleavage activity. 

3. The method of claim 2, wherein said intein comprises an 
intein having a cysteine residue at the N-terminal of the 
intein. 



The method of claim 3, wherein said target protein is 
generated by thiol reagent-induced cleavage of said 
intein. 

The method of claim 4, wherein said thiol reagent is 
selected from the group consistmg of MESNA. thiophenoi, 
DTT, B-mercaptoethanol or derivatives thereof. 

A fusion protein produced by the method of any one of 
claims 1-5. 
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7. A cyclic protein produced by the method of claim 1. 

8. A modified intein comprising a mutant Mxe GyrA intein 
5 capable of thiol reagent-induced cleavage to produce a 

thioester at the C-terminal of an adjacent target 
protein. 

9. A method of generating a reactive thioester comprising 
10 contacting a thiol reagent selected from the group 

consisting essentially of MESNA, thiophenol, DTT, 3- 

mercaptoethanol or derivatives thereof with a precursor 
comprising a target protein and intein. 

15 10. A method for screening thiol reagents which cleave a 

target intein comprising the steps of: 

(a) isolating a precursor comprising a protein and a 
modified intein; 

(b) contacting a thiol reagent with the precursor of 

20 step (a); 

(c) determining whether a splicing or cleaving event 

occurs. 

11. The method of claim 10, compnsing the further step of 
25 determining wnetner ihe spliced or cleaved product of 

step (c) can ligate to a target peptide having an N- 
temrinal cytokine. 
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Figure 1 

Iniem Mediated Protein Ligation 

Step 1: N-S Shift Target 



Step 2: Thiol Mediated 
Cleavage 



Step 3: Peptide Attack 



Step 4: S-N Shift 



Chitin Binding 
Intein Domain 
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Figure 3. Direct Ligation Reaction 
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Figure 5 



o n 1 



i u ^ 



40 



S: GTTTATTTTT CTAAATACAT TCAAATATGT ATCCGCTCAT GAGACAATP-Ji 
101 CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA TGAGTATTCA 
151 ACATTTCCGT GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG 
TTTTTGCTCA CCCAGAAACG CTGGTGAAAG TAAAAGATGC TGAAGATCAG 
TTGGGTGCAC GAGTGGGTTA CATCGAACTG GATCTCAACA GCGGTAAGAT 
CCTTGAGAGT TTTCGCCCCG AAGA^XGTTC TCGAATGATG AGCACTTTTA 
AAGTTCTGCT ATGTGGCGCG GTATTATCCC GTGTTGACGC CGGGCAAGAG 
CAACTCGGTC GCCGCATACA. CTA.TTCTCAG AATGACTTGG TTGAGTACTC 
ACCAGTCACA ;;A.^J^GCA.TC TTACGGATGG GATr.^C7vGTA AGAGA^TTAT 
^^^^^P^^^^ '^AT.AACCA'^G AGTGATA/vCA CTGCC^GCCAA CTTAGTTCTG 
.^^ r^r^^r^r-r-T.r c AG^'^ ^ ^ GrTr^TTTGr ACAACATGGG 
^ ^^r-r^^r^rr^r^ n^f^ n-r^Qncz Ar^GGAGrTG AATGAAGGCA 
Si:: TACCAAACG7. CGAGCGTGAC ACGACGATGC CTGTAGCAAT GGGA.ACAACG 
701 TTGCGCAAAC TATT/xACTGG CGAAGTAGTT ACTCTAGCTT CCCGGCAACA 
7- ATTAATAGAC TGGATGGAGG CGGATAAAGT TGCAGGACCA CTTCTGCGCT 
801 CGGCCCTTCC GGGTGGCTGG TTTATTGCTG ATAA-.ATCTGG AGGCGGTGAG 
851 CGTGGGTCTC GCGGTATCAT TGCAGCACTG GGGCCAGATG GTA-AGCCCTC 
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901 CCGTATCGTA CTTATCTACA CGACGGGGAG TCAGGCAACT ATGGATGA.AC 
9E1 GAAATAGACA GATCGCTGAG ATAGGTGCCT CACTGATTAA GCATTGGTA^. 
:0L.: CTGTCAGACC A-AGT-TTACTC ATATATACTT TAGATTGATT TACCCCGGTT 
1051 GATAATCAGA AAAGCCCCA.A AAACAGGAAG ATTGTATAAG CAAATATTTA 
1101 .AA.TTGTAAAC GTTAATATTT TGTTAAAATT CGCGTTAAAT TTTTGTTAAA 
1151 TCAGCTCATT TTTTAACCAA TAGGCCGAAA TCGGCAAAAT CCCTTATAAA 

12 01 TCAAAAGAAT AGCCCGAGAT AGGGTTGAGT GTTGTTCCAG TrTCGAACAA 
1251 GAGTCCACTA TTAAAGAACG TGGACTCCAA CGTCAAAGGG CGAAAAACCG 

13 01 TCTATCAGGG CGATGGCCCA CTACGTGAAC CATCACCCAA ATCAAGTrTT 

13 51 irGGGGTCGA GGTGCCGTAA AGCACTAAJ^T CGGAACCCTA AAGGGAGCCC 

14 01 CCGATTTAGA GCTTGACGGG GAAAGCCGGC GAACGTGGCG AGAAAGGAAG 
14 51 GGAAGAAAGC GAAAGGAGCG GGCGCTAGGG CGCTGGCAAG TGTAGCGGTC 
1501 ACGCTGCGCG TAACCACCAC ACCCGCCGCG CTTA/\TGCGC CGCTACAGGG 
" "^.-.1 CGCGT/vA^AAG GATCTAGGTG ,\AGATCCTTT TTGATAATCT CJ'i :Gx-.-^'^ 
1601 ATCCC-TTAAC GTGAGTTTTC GTTCCACTGA GCGTCAGACC CCGTAGAAAA 
1651 GATCAAAGGA TCTTCTTGAG ATCCTTTTTT TCTGCGCGTA ATCTGCTGCT 
1701 TGCAAACAAA AAAACCACCG CTACCAGCGG TGGTTTGTTT GCCGGATCAA. 

17 51 GAGCTACCAA. CTCTTTTTCC GAAGGTAACT GGCTTCAGCA GAGCGCAGAT 

18 01 ACCAAATACT GTCCTTCTAG TGTAGCCGTA GTTAGGCCAC CACTTCA-AGA. 
185- /vCTCTGTAGC ACCGCCTACA TACCTCGCTC TGCTAATCCT GTTACCAGTG 
1901 GCTGCTGCCA GTGGCGATAA GTCGTGTCTT ACCGGGTTGG ACTCAAGACG 
1951 ATAGTTACCG GATAAGGCGC AGCGGTCGGC CTGAACGGGG GCTTCGTGCA 
20C: CACAGCCCAG CTTGG7.GCGA. ACGACCTACA CCGAACTGAG ATACCTACAG 
one: -nTGAGC-A- GAGA^AGCGC CACGCTTCCC GA^GGGAGAA AGGCGGACAG 

r.r.."r;.-:r-AGGG TCGGAACAGG AGAGCGCACG AGGGAGCTT'. 
2151 CAGGCGGA-A^ CnrrTGGTAT CTTTATAGTC CTGTCGGGTT TCGCCACCTC 
2201 TGACTTGAGC GTCGATTTTT GTGATGCTCG TCAGGGGGGC GGAGCCTATG 
22 51 GAAAAACGCC AGCAACGCGG CCTTTTTACG GTTCCTGGCC TTTTGCTGGC 

6/11 




PCT/TJS99/22776 



WO 00/18881 



CGTATTACCG CCTTTGAGTG AGCTGATACC GCTCGCCGCA GCCGAACGAC 
,4 0: CGAGCGCAGP GAGTCAGTGA GCGAGGAAGC TATGGTGCAC TCTCAGTACA 
24 51 ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATACACTCC GCTATCGCTA 
2501 CGTGACTGGG TCATGGCTGC GCCCGGACAC CCGCCAACAC CCGCTGACGC 
2551 GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA CAAGCTGTGA 
2601 CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC ATCAGCGAAA 
2651 CGCGCGAGGC AGCTGCGGTA AAGCTCATCA GCGTGGTCGT GCAGCGATTC 
2701 ACAGATGTCT GCCTGTTCAT CCGCGTCCAG CTCGTTGAGT TTCTCCAGAA. 
2751 GCGTTAATGT CTGGCTTCTG ATAAAGCGGG CCATGTTAAG GGCGGTTTTT 
2801 TCCTGrrrGG TCACTTGATG CCTGCGTGTA AGGGGGAATT TCTGTTCATG 
2 851 GGGGTAATGA TACCGATGAA ACGAGAGAGG ATGCTCACGA TACGGGTTAC 
2901 TGATGATGAA CATGCCCGGT TACTGGAACG TTGTGAGGGT AAACAACTGG 

^ ^^^^^^^^.^ r-A^AriPJkAPJ^ TCACTCAGGG TCAATGCCac 

2 951 CGGTATGGAT GCGGc&or^^^ ^A^A^^^^- — 

crn-^r TCGGCCGCCA TGCCGGGG.A1 

3 001 ccgaACGCCA ^<^<-<—^^^~ ' 

3051 AATGGCCl^: TTCTCGCCGA A^CGTTTGGT GGCGGGACCA GTGACGAAGG 

3101 CTTGAGCGAG GGCGTGCAAG ATTCCGAATA CCGCAAGCGA CAGGCGGATC 

3151 ATCGTCGCGC TCCAGCGAAA GCGGTCCTCG CCGAAAATGA CCCAGAGCGC 

3201 TGCCGGCACC TGTCCTACGA GTTGCATGAT AAAGAAGACA GTCATAAGTG 

3251 CGGCGACGAT AGTCATGCCC CGCGCCCACC GGAAGGAGCT GACTGGGTTG 

33 01 AAGGCTCTCA AGGGCATCGG TCGAGATCCC GGTGCCTAA.T GAGTGAGCTA 

3351 ACTTACATTA ATTGCGTTGC GCTCACTGCC CGCTTTCCAG TCGGGAAACC 

3401 TGTCCTGCCA GCTGCATTAT. TG.A.ATCGGCC AACGCGCGGG GAGAGGCG.;T 

„^ ^^^^^rr^rm-r"^ TTTCACCAG"^^ GAGACGGGCA 

3 451 TTGCGTATTG GGCGCCAltv^^^ . ^-^^ - - - — - . . . ^ 

..r<^^nGrccT gagagagttg cagcaagggc; 

r~ -^-^^.'-r.z.i'.AT, -^rrTGTTTC;A tggtggtt^^'. 

, ^r^^r"v^rar.T ATCGTCGTAT CCCACTACCG 
3601 CGGCGGG/^TA x n.^'^^- G.^u _ - 

3651 AGATATCCGC ACCAACGCGC AGCCCGGACT CGGTAA.TGGC GCGCATTGCG 
3701 CCGAGCGCCA TCTGATCGTT GGCA..CCAGC ATCGGAGTGC GAACGATGCC 
. ,,_„„j,p.,.p _~,^cCATGG -T-nTTG/^--- .v. ■ : 
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38 01 CGCCTTCCCG TTCCGCTATC l,O^TGAAT1 I U.^.i^^^ 

^ ^^^^^r-./'-nr^ lii'^j. i^^i ^i^'^TA ATGGGCCCG*. 
U,-,: TGCCAGCCAG CCAGACGCAG A^-GCU^^^*-" rt^"- 

3 90 1 TAA.CAGCGCG ATTTGCTGGT GACCCAATGC GACCAGATGC TCCACGCCCA 

39 51 GTCGCGTACC GTCTTCATGG GAGAAAATAA TACTGTTGAT GGGTGTCTGG 
4001 TCAGAGACAT CAAGAAATAA CGCCGGAACA TTACTGCAGG CAGCTTCCAC 
405: AGCAATGGCA TCCTGGTCAT CCAGCGGATA GTTAATGA.TC AGCCCACTGA 
4101 CGCGrrGCGC GAGAAGATTG TGCACCGCCG CrrTACAGGC TTCGACGCCG 
4151 CTTCGTTCTA CCATCGACAC CACCAGGCTG GCACCCAGTT GATCGGCGCG 
42C1 AGArriAATC GCCGCGACAA TTTGCGACGG CGCGTGCAGG GCCAGACTGG 
4251 AGGTGGCAAC GCCAATCAGC AACGACTGTT TGCCCGCCAG TTGTTGTGCC 
4301 ACGCGGTTGG GAATGTAATT CAGCTCCGCC ATCGCCGCTT CCACTTTTTC 

4351 ccGCG-rrrrc gcagaaacgt ggctggcctg gttcaccacg cgggaaacgg 

„^^,^..r^> CA^^C^GGrA TACTCTGCGA CATCGT/^^^--- 

4501 ACCGCGAAAG GTTTTGCGCC ATTCGATGGT GTCCCGGATC TCGACGCTCT 

4551 CCCTTATGCG ACTCCTGCAT TAGGAAGCAG CCCAGTAGTA GGTTGAGGCC 

4601 GTTGAGCACC GCCGCCGCAA GGAATGGTGC ATGCCGCCCT TTCGTCTTCA 

4651 AGAATTAATT CCCAA.TTCCA GGCATCAAAT AAAACGA.AAG GCTCAGTCGA 

47 01 AAGACTGGGC CTTTCGTTTT ATCTGTTGTT TGTCGGTGAA CGCTCTCCTG 

. ^^■vrrnrraC.C. AGCGGATTTG AACGTTGCGA AGCAACGGCC 

4 801 CGGAGGGTGG CGGGCAGGAC GCCCGCCATA AACTGCCAGG AA.TTAATTCC 

48 51 AGGCATCAA7. TAAA^.CGAAA GGCTCAGTCC AAAGACTGGG CCTTTCGTTT 

„^„^,3CTaA ACGCTCTCrT GAGTAGGACA .AA.TCCGCCGG 

■ . . .^r^rr^r-r-- -^^PAArGG^ ^XGGAfX;GTG GCGGGCAGGA 

_^ ^...^ - - - . ■ ■ 

,^ _ - ^rr^T, ^^ rr.'p,'- T A H f t G C AJ^ AT AAAuAC G A.^- 

_^^^«^,T^r^ rr.^^V'rr^r:^'^^^ TTTGTGGGTG 

^/.CTG G/^^^.AG Arron GCCTTTCGT i . T/-. i L . b . . ^ ^ - ^ 
.V\GGCTGTCr T^GAGTAG^.AC AAA.TCCGCGG GGAGCGGATT TG.V.CGTTG' 



5 U ::; ^ /^ovjrv 
:> 1 0 1 
5151 



GAAGCAAGGG GGGGGAGGGT GGCGGGGAGG AGGGGGGCGA TAAAGTGGGA 
nC^AATTAAl^^ --AGGCA-V. AATAAAACG. AAGGGTCA."^ CGAAAGAG^G 
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52j". ggcctttcgt tttatctg 



^_^^^^,,CGGT GAACGCTCTC CTGAGTAGG; 



53C: CA.VATCCGCC GGGAGCGGAT rTGAACGTTG CGAAGCAACG GCCCGGAGGG 

. , , .^r^r-r-r-r :.T ^^rTGCC AGGAATTGGG GATCGGAATT 

540. AATTCCCGGT TTAA-ACCGGG GATCTCGATC CCGCGAAATT aatacgactc 

64 5: ACTATAGGGG A.ATTGTGAGC GGATA.AGAAT TCCCCTCTAG AAATAATTTT 

55 01 GTTTAACTTT AAGAAGGAGA TATAcatatg gctagctcgc gagtcgacgg 

5551 cggccgcctc gagggctctt ccTGCATCAC GGGAGATGCA CTAGTTGCCC 

5601 TACCCGAGGG CGAGTCGGTA CGCATCGCCG ACATCGTGCC GGGTGCGCGG 

5651 CCCAACAGTG ACAACGCCAT CGACCTGAAA GTCCTTGACC GGCATGGCAA 

5701 TCCCGTGCTC GCCGACCGGC TGTTCCACTC CGGCGAGCAT CCGGTGTACA 

5751 CGGTGCGTAC GGTCGAAGGT CTGCGTGTGA CGGGCACCGC GAACCACCCG 

5801 TTGTTGTGTT TGGTCGACGT CGCCGGGGTG CCGACCCTGC TGTGGAAGCT 

5851 GATCGACGA/. ATC.AAGCCGG GCGATTACGC GGTGATTCA/. CGCAGCGCAT 

5 951 ACAACCTACA CAGTCGGCGT CCCTGGACTG GTGCGTTTCT TGGAAGCACA 
6001 CCACCGAGAC CCGGACGCGC AAGCTATCGC CGACGAGCTG ACCGACGGGC 

6 051 GGTTCTACTA CGCGAAAGTC GCCAGTGTCA CCGACGCCGG CGTGCAGCCG 
6101 GTGTATAGCC TTCGTGTCGA. CACGGCAGAC CACGCGTTTA TCACGAACGG 
6151 GTTCGTCAGC CACGCTACTG GCCTCACCGG TCTGAACTCA GGCCTCACGA 
b"0- -AA^'-CrTGG TGTATCCGCT TGGCAGGTCA ACACAGCTTA TACTGCGGGA 
62 51 CA.ATTGGTCA CATATAACGG CAAGACGTAT AAATGTTTGC AGCCCCACAC 

^^^^r-r-TA- "-~^^>irr'^ T'r-c-TGf'f^T'TG TGGCAGCTTC 
6301 CTCCTTGGCA GGATGGGAAl '^.-^ ; u-^/^CG . 

acaaggGGAT rCGGCTGCTA ACAAAGCCCG AAAGGAAGCT 
,.,^„„r:r.-Tr; — ACv-G" TGAvGCAAT.V-. CTAGCATAAr CCCTTGGGGC 
„„, — T-r:AGf"'- rT'^'^TTTGC'l GAAAGGAGGA. ACTATATCu ' j 
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Figure 

Hpa I ligation 
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